Search CORE

15 research outputs found

DIRECTOR: Generator-Classifiers For Supervised Language Modeling

Author: Arora Kushal
Shuster Kurt
Sukhbaatar Sainbayar
Weston Jason
Publication venue
Publication date: 25/11/2022
Field of study

Current language models achieve low perplexity but their resulting generations still suffer from toxic responses, repetitiveness and contradictions. The standard language modeling setup fails to address these issues. In this paper, we introduce a new architecture, {\sc Director}, that consists of a unified generator-classifier with both a language modeling and a classification head for each output token. Training is conducted jointly using both standard language modeling data, and data labeled with desirable and undesirable sequences. Experiments in several settings show that the model has competitive training and decoding speed compared to standard language models while yielding superior results, alleviating known issues while maintaining generation quality. It also outperforms existing model guiding approaches in terms of both accuracy and efficiency

arXiv.org e-Print Archive

Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation

Author: Arora Kushal
Asri Layla El
Bahuleyan Hareesh
Cheung Jackie Chi Kit
Publication venue
Publication date: 03/04/2022
Field of study

Current language generation models suffer from issues such as repetition, incoherence, and hallucinations. An often-repeated hypothesis is that this brittleness of generation models is caused by the training and the generation procedure mismatch, also referred to as exposure bias. In this paper, we verify this hypothesis by analyzing exposure bias from an imitation learning perspective. We show that exposure bias leads to an accumulation of errors, analyze why perplexity fails to capture this accumulation, and empirically show that this accumulation results in poor generation quality. Source code to reproduce these experiments is available at https://github.com/kushalarora/quantifying_exposure_biasComment: Accepted in Findings of ACL 202

arXiv.org e-Print Archive

The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation

Author: Arora Kushal
Cheung Jackie C. K.
O'Donnell Timothy J.
Precup Doina
Weston Jason
Publication venue
Publication date: 13/02/2023
Field of study

State-of-the-art language generation models can degenerate when applied to open-ended generation problems such as text completion, story generation, or dialog modeling. This degeneration usually shows up in the form of incoherence, lack of vocabulary diversity, and self-repetition or copying from the context. In this paper, we postulate that ``human-like'' generations usually lie in a narrow and nearly flat entropy band, and violation of these entropy bounds correlates with degenerate behavior. Our experiments show that this stable narrow entropy zone exists across models, tasks, and domains and confirm the hypothesis that violations of this zone correlate with degeneration. We then use this insight to propose an entropy-aware decoding algorithm that respects these entropy bounds resulting in less degenerate, more contextual, and "human-like" language generation in open-ended text generation settings

arXiv.org e-Print Archive

Drifts in protein and RNA as influenced by Rifampicin during seed germination in Pinus kesiya L. Royal ex-Gord.

Author: Arora Kushal B.
Kohli R. K.
Publication venue: Indian Society of Tree Scientists, India
Publication date
Field of study

Effect of Rifampicin-a metabolic inhibitor on the contents of total soluble proteins, and RNA during imbibition, subsequent seed germination and seedling emergence has been studied in embryonal and extra-embryonal parts of Pinus kesiya

X-MODDES (eXtended Multi Operator Delimiter Based Data Encryption Standard)

Author: Arora Kushal.
Gope Prosanta.
Kaushik Akhil.
Kumar Neeraj.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2010
Field of study

An algorithm is considered computationally secure if it can not be broken with standard resources, either current or future. In this paper we have introduced a new block cipher algorithm named X-MODDES. It is unique independent approach which uses several computational steps along with string of operators and randomized delimiter selections by using some suitable mathematical logic. X-MODDES is specially designed to produce different cipher texts by applying same key on same plain text. Thus a new protocol has been designed to encrypt a given text, which allows a higher level security as compare to MODDES. The Algorithm is successfully implemented on text file, corresponding digital image file and audio file. Here we have also tried to highlight the performance of some well known data algorithms like DES, Triple-DES, AES (Rijndael), MODDES, and compare them with the X-MODDES. Finally it has been proved that X-MODDES is one of the best performing partial symmetric key algorithm among the above mention algorithms particularly for the text message with limited size

Crossref

aCQUIRe

ACQUIRE

Macromolecular drifts associated with the effects of herbicides on the rooting of stem cuttings and rooting potential of Lantana camara L. var. aculeata

Author: Arora Kushal B.
Kohli R. K.
Kumari Bimlendra
Publication venue: Indian Society of Tree Scientists, India
Publication date
Field of study

Rooting of stem cuttings and rooting potential of Lantana camara L. was studied alongwith the changes in protein and RNA content occuring during the rooting process in response to certain herbicides. Paraquat, butachlor, CuSO4 and 2,4,5-T completely checked the rooting of the stem cuttings. Atrazine, TCA and 2,4-D retarded the rooting response. Low as well as high doses of paraquat and butachlor and only higher dose of atrazine were completely inhibitory. CuSO4 could not check the rooting potential of the plant compared to the complete inhibition of the rooting of the stem cuttings. Paraquat, atrazine, butachlor and CuSO4 altered protein and RNA contents significantly during different stages of rhizogenesis of the stem cuttings. The significance of study is discussed in the light of controlling the vegetative reproduction of L.camara which is a noxious weed on abandoned and arable lands

Correction to Inhibition of Hypoxia Inducible Factor 1–Transcription Coactivator Interaction by a Hydrogen Bond Surrogate α-Helix

Author: Bogdan Z. Olenyuk
Laura K. Henchey
Paramjit S. Arora
Published
Ramin Dubey
Ross N. Chapman
Swati Kushal
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref